Computational Tools and Resources for Linguistic Studies
نویسندگان
چکیده
First, a very useful searching engine, Key Word in Context (KWIC), is introduced. This tool can automatically extract linguistically significant patterns from large corpora and help linguists discover syntagmatic generalizations. Second, Dynamic Clustering and Hierarchical Clustering are introduced for identifying natural clusters of words or phrases in distribution. Third, statistical measures which could be used to measure the degree of cohesion and correlation among linguistic units are presented. These tools can help linguists identify the boundaries of lexical units. Fourth, alignment tools for aligning parallel texts at the word, sentence and structure levels are presented for linguists who do comparative studies of different languages. Fifth, we introduce Sequential Forward Selection (SFS) and Classification and Regression Tree (CART) for automatic rule ordering. Finally, some available electronic Chinese resources are described to provide reference purposes for those who are interested.
منابع مشابه
They Want To Eradicate the Nation: A Cross-Linguistic Study of the Attitudinal Language of Presidential Campaign Speeches in the USA and Iran
Politicians adopt a variety of linguistic strategies in their speeches to connect with their audience. To name one, appraisal, as a system of interpersonal meaning, is concerned with evaluation where resources are used for negotiating social relationships. Despite their significance in shaping texts, there have hardly been any extensive inventories of appraisal tools contrasting electoral speec...
متن کاملIntegration of an XML electronic dictionary with linguistic tools for natural language processing
This study proposes the codification of lexical information in electronic dictionaries, in accordance with a generic and extendable XML scheme model, and its conjunction with linguistic tools for the processing of natural language. Our approach is different from other similar studies in that we propose XML coding of those items from a dictionary of meanings that are less related to the lexical ...
متن کاملDistributing and Porting General Linguistic Tools
Our main motivation is to build general and adaptable linguistic tools and we have faced the problem of their portability. We first make a quick description of the linguistic tools we have at hand and we explain why linguistic tools, unlike other software tools, present pmticular portability problems. We then discuss code portability and also data portability and we describe the method we have ...
متن کاملUNITEX-PB, a set of flexible language resources for Brazilian Portuguese∗
This work documents the project and development of various computational linguistic resources that support the Brazilian Portuguese language according to the formal methodology used by the corpus processing system called UNITEX. The delivered resources include computational lexicons, libraries to access compressed lexicons, and additional tools to validate those resources.
متن کاملComputational Linguistics at Universiti Sains Malaysia
This paper gives a brief history of UTMK, a computer-aided translation unit, and reports on her projects and research co-operations. After its beginnings as a thesis project on Malay affixation, UTMK’s interest moved from machine translation to the development of tools for translation. Today, UTMK’s focus is on the development of natural language processing applications and tools (internet brow...
متن کاملWebLicht: Web-based LRT Services in a Distributed eScience Infrastructure
eScience enhanced science is a new paradigm of scientific work and research. In the humanities, eScience environments can be helpful in establishing new workflows and lifecycles of scientific data. WebLicht is such an eScience environment for linguistic analysis, making linguistic tools and resources available network-wide. Today, most digital language resources and tools (LRT) are available by...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- IJCLCLP
دوره 2 شماره
صفحات -
تاریخ انتشار 1997